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HIGH-FIDELITY DNA SEQUENCING USING SOLID PHASE 
CAPTURABLE DIDEOXYNUCLEOTIDES AND MASS SPECTROMETRY 



Background Of The Invention 

Thx-oughout this application, various publications are 
referenced in parentheses by author and year. Full 

10 citations for these references may be found at the 

end of the specification immediately preceding the 
claims . The disclosures of these publications in 
their entireties are hereby incorporated by reference 
into this application to more fully describe the 

15 state of the art to which this invention pertains. 

The ability to sequence deoxyribonucleic acid (DNA) 
accurately and rapidly is revolutionizing biology and 
medicine. The confluence of the massive Human Genome 

2 0 Project is driving an exponential growth in the 

development of high throughput genetic analysis 
technologies . This rapid technological development 
involving chemistry, engineering-) biology, and 
computer science makes it possible to move from 
25 studying single genes at a time to analyzing and 

comparing entire genomes. 

w 

With the completion of the first entire human genome 
sequence map, many areas in the genome that are 

3 0 highly polymorphic in both exons and introns will be 

known. The pharmacogenomics challenge is to 

comprehensively identify the genes and functional 
polymorphisms associated with the variability in drug 



masked by the high fluorescence signal from excess 
dye-labeled primers or dye-labeled terminators, and 
are therefore difficult to. identify. 

Mass spectrometry is able to overcome the 
difficulties (GC compressions and heterozygote 
detections) typically . encountered when using 
capillary sequencing techniques. However, it is 
unable to meet the read length and throughput 
requirements for large scale sequencing projects. In 
addition, poor resolution prevents the sequence 
determination of large DNA fragments. At the present 
time , the read lengths are insufficient for de novo 
DNA sequencing and the stringent clean sample 
requirements for using mass spectrometry for DNA 
sequencing are not entirely met by existing 
procedures. For this reason, most of the reported 
mass spectrometry applications have focused on single 
nucleotide polymorphism (SNP) detection. Several 
methods have been explored to this end. The most 
common approach is to extend a primer by a single 
nucleotide and detect what was added. Another 
technique developed by Tang et al . (1999) involves 
immobiliz"ing~' DNA templates on a chip and again 
extending one base to determine a particular SNP. 
The same group has explored the analysis of 
restriction fragments to determine multiple SNPs at 
once (Chiu et al . 2000) . Each of these techniques 
has been limited to analyzing only a few fragments at 
a time due to current limitations in mass spectra 
resolution. While these methods are sufficient for 
determining a SNP at a particular base, they require 
previous knowledge of the preceding sequence for 
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DNA sequencing fragments are not eliminated and are 
introduced to the mass spectrometer. False stops 
occur sequencing when a deoxynucleotide rather than a 
dideoxynucleotide terminates a sequencing fragment. 
5 It has been shown that false stops and primers which 

have dimerized can produce- peaks in the mass spectra 
that can mask the actual- results preventing accurate 
base identification (Roskey et al . 1996). 

10 The "lock and key" functionality of biotin and 

streptavidin is often utilized in biological sample 
preparation as a way to remove undesired impurities 
(Langer et al . 1981) . To date these methods have 
involved attaching the biotin moiety on the 5' end of 

15 the primer or the sequencing DNA template for capture 

by streptavidin coated magnetic beads (Tong and Smith 
1992, 1993) . When the samples are purified, false 
stops and primers that can interfere with the 
resulting sequencing data are not eliminated. 

20 

In addition, a further drawback of previous mass 
spectrometry sequencing methods was the requirement 
of four separate reactions, one for each 
dideoxynucleotide terminator analogous to the 
25 approach used in dye-labeled primer sequencing. 



Ideally, for sequencing with MALDI-TOF mass 
spectrometry, one would like to establish a procedure 
that allows sequencing reactions to be performed in 
one tube to simplify sample preparation, to use cycle 
sequencing to increase the yield of the DNA 
sequencing fragments, and to have a method that only 
isolates pure DNA sequencing fragments free from 
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Biotinylated dideoxynucleotides and streptavidin 
coated magnetic beads can be used to generate high 
quality sequencing mass spectra of Sanger cycle 
sequencing DNA fragments on a MALDI-TOF mass 
spectrometer. The method disclosed here provides an 
efficient way to eliminate false stopped DNA 
fragments and excess primers and salts in one simple 
purification step, while still allowing the use of 
cycle sequencing to generate a high yield of 
sequencing fragments. Furthermore, it avoids the 
above-mentioned pitfalls of gel electrophoresis. 

The subject application discloses that mass-tagged 
dideoxynucleotides which- are coupled with biotin or 
photocleavable biotin can increase the mass 
separation of the DNA sequencing fragments on the 
mass spectra, giving better resolution than 
previously achievable. 

Also, this application discloses a method for 
creating streptavidin-coated porous * channels that can 
be used in light directed cleavage of the biotin- 
streptavidin complex. This is important as present 
commercially available streptavidin coated" magnetic 
beads are inadequate for photocleavage purposes, in 
that they are opaque to ultraviolet light. 

The system disclosed herein provides a high 
throughput and high fidelity DNA sequencing system 
for polymorphism and pharmacogenetics applications. 
Compared to gel electrophoresis sequencing, this 
system produces very high resolution of sequencing 
fragments and extremely fast separation in the time 
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Summary Of The Invention 

This invention is directed to a method for sequencing 
DNA by detecting the identity of a dideoxynucleot ide 
5 incorporated to the 3' end of a DNA sequencing 

fragment using mass spectrometry, which comprises: 

(a) attaching a chemical moiety via a linker to 
a dideoxynucleotide to produce a labeled 
dideoxynucleotide; 

10 (b) terminating a DNA sequencing reaction with 

the labeled dideoxynucleotide to generate a 
labeled DNA sequencing fragment, wherein 
the DNA sequencing fragment has a 3' end 
and the chemical moiety is attached via the 

15 linker to the 3' end of the DNA sequencing 

fragment; 

(c) capturing the labeled DNA sequencing 
fragment on a surface coated with a 
compound that specifically interacts with 

2 0 the chemical moiety attached via the linker 

to the DNA sequencing fragment, thereby 
capturing the DNA sequencing fragment ,- 

(d) washing the surface to remove any non-bound 
component ; 

2 5 (e) freeing the DNA" sequencing fragment from 

the surface ; and 
(f) analysing the DNA sequencing fragment using 
mass spectrometry so as to sequence the 
DNA. 
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This invention provides a method for sequencing DNA 
by detecting the identity of a plurality of 
dideoxynucleot ides incorporated to the 3' end of 
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linker to a 5-position of cytosine or thymine or to a 
7 -position of adenine or guanine . 

The invention provides . a system for separating a 
5 chemical moiety from other components in a sample in 

solution, which comprises: 

(a) a channel coated with a compound that 
specifically interacts with the chemical 
moiety, wherein the channel comprises a 

10 plurality of ends; 

(b) a plurality of wells each suitable for 
holding the sample; 

(c) a connection between each end of the 
channel and a well; and 

15 (d) a means for moving the sample through the 

channel between:* wells . 

The invention provides a' method of increasing mass 
spectrometry resolution between different DNA 

2 0 sequencing fragments, which comprises attaching 

different linkers to different dideoxynucleot ides 
used to terminate a DNA sequencing reaction and 
generate different DNA sequencing fragments, wherein 
the different linkers ' increase mass separation 

25 between the different DNA sequencing fragments, 

thereby increasing mass spectrometry resolution. 



ddTTP) can be used with any of the illustrated 
1 inkers . 

Figure 5: Synthesis scheme for mass tag linkers. For 
illustrative purposes, the linkers are labeled to 
correspond to the specific ddNTP with which they are 
shown coupled in Figures 4, 6, 8, 9 and 10. However, 
any of the three linkers can be used with any ddNTP. 

Figure 6: The synthesis of ddATP-Linker-II-ll-Biotin . 

Figure 7: DNA sequencing products are purified by a 
streptavidin coated porous silica surface. Only the 
biotinylated fragments are captured. These fragments 
are then cleaved by ultraviolet irradiation (hv) to 
release the captured fragments, leaving the biotin 
moiety still bound to the • streptavidin . 

Figure 8: Mechanism for the cleavage of 
photocleavable linkers. 



Figure 9: The structures 
photocleavable (PC) biotiri"; 
(dcLSTP, ddCTPr- ddGTP, ddTTP) 
the shown linkers . 

Figure 10 : The synthesis 
Biotin. PC = photocleavable. 



of ddNTPs linked to 

Any of the four ddNTPs 

can be used with any of 

of ddATP-Linker-II -PC- 



Figure 11: Schematic for capturing a DNA fragment 
terminated with a ddNTP on., a surface and then for 
freeing the ddNTP and DNA fragment. The 
dideoxynucleotide (ddNTP) , which is on one end of the 
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Detailed Description Of The Invention 

The following definitions are presented as an aid in 
understanding this invention. 

5 

The standard abbreviations for nucleotide bases are 
used as follows: adenine (A), cytosine (C) , guanine 
(G) , thymine (T) , and uracil (U) . 

10 This invention is directed to a method for sequencing 

DNA by detecting the identity of a dideoxynucleotide 
incorporated to the 3' end of a DNA sequencing 
fragment using mass spectrometry, which comprises: 

(a) attaching a chemical moiety via a linker to 
a dideoxynucleotide to produce a labeled 
dideoxynucleotide ; 

terminating a DNA sequencing reaction with 
the labeled dideoxynucleotide to generate a 
labeled DNA 'sequencing fragment, wherein 
the DNA sequencing fragment has a 3' end 
and the chemical moiety is attached via the 
linker to the 3' end of the DNA sequencing 
fragment ; 

capturing the labeled DNA seq^iencing" 
fragment on a surface coated with a 
compound that specifically interacts with 
the chemical moiety attached via the linker 
to the DNA sequencing fragment, thereby 
capturing the DNA sequencing fragment ; 
washing the surface to remove any non-bound 
component ; 

freeing the DNA sequencing fragment from 
the surface; and 



15 



20 



25 



(b) 



(c) 
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(d) 



<e) 



In one embodiment, the chemical moiety is attached 
via a different * 1 ' "linker to different 

dideoxynucleotides . In one embodiment, the different 
linkers increase mass separation between different 
labeled DNA sequencing fragments and thereby increase 
mass spectrometry resolution. 

In one embodiment, the dideoxynucleotide is selected 
from the group consisting of 2' ,3' -dideoxyadenosine 
5 ' -triphosphate (ddATP) , 2 ' , 3 ' -dideoxyguanosine 5'- 
triphosphate (ddGTP) , 2 ' , 3 ' -dideoxycyt idine 5 ' - 
triphosphate (ddCTP) , and 2 ' , 3 ' -dideoxy thymidine 5 ' - 
triphosphate (ddTTP) . 

In different embodiments of the methods described 
herein, the interaction between the chemical moiety 
attached via the linker to the DNA sequencing 
fragment and the compound on the surface comprises a 
biotin-streptavidin interaction, a phenylboronic 
acid-salicylhydroxamic acid interaction, or an 
antigen -antibody interaction . 

In one embodiment, the step of freeing the DNA 
sequencing fragment from the suriace comprises 
disrupting the interaction between the chemical 
moiety attached via the linker to the DNA sequencing 
fragment and the compound on the surface. In 
different embodiments, the interaction is disrupted 
by a means selected from the group consisting of one 
or more of a physical means, a chemical means, a 
physical chemical means, heat, and light. In one 
embodiment, the interaction is disrupted by 
ultraviolet " light . In different embodiments, the 
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CH 2 NHC(0)CF 3 




and 



CH2NHC(0)CF3 



10 



15 



20 



25 



In one embodiment, a plurality of different labeled 
dideoxynucleotides is used to generate a plurality of 
different labeled DNA sequencing fragments. In one 
embodiment, a plurality of different linkers is used 
to increase mass separation between* different labeled 
DNA sequencing fragments and . thereby increase mass 
spectrometry resolution. 



In one embodiment, the chemical moiety comprises 
biotin, the labeled dideoxynucleotide is a 
biotinylated dideoxynucleotide, the labeled DNA 
sequencing fragment is a biotinylated DNA sequencing 
fragment, and the surface is a streptavidin- coated 
solid surface. In one embodiment, the biotinylated 
dideoxynucleotide is selected. from the group 
consisting of ddATP- 11-biotin, ddCTP-ll-biot in, 
ddGTP-ll-biotin, and ddTTP-16-biotin. 



In one embodiment, the biotinylated dideoxymicleotide 
is selected from the group consisting of: 




O F 



In one embodiment, the biotinylated dideoxynucleot ide 
is selected from the group consisting of: 




HN^NH 

o 



In one embodiment, the streptavidin-coated solid 
surface is a streptavidin-coated magnetic bead or a 
streptavidin-coated silica glass. 

In one embodiment of the method, steps (b) to (e) are 
performed in a single container or in a plurality of 
connected containers. 



In one embodiment, the mass spectrometry is matrix- 
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In different embodiments, the linker can comprise a 
chain structure, or a structure comprising one or 
more rings, or a structure comprising a chain and one 
or more rings . 

In different embodiments, the linker is cleavable by 
a means selected from the group consisting of one or 
more of a physical means, a chemical means, a 
physical chemical means, heat, and light- In one 
embodiment, the linker is cleavable by ultraviolet 
light. In different embodiments, the linker is 
cleavable by ammonium hydroxide, formamide, or a 
change in pH (-log H + concentration) . 

In different embodiments of the linker, the chemical 
moiety comprises biotin, streptavidin, phenylboronic 
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chemical means, a physical chemical means, heat, and 
light. In one embodiment, the linker is cleavable by 
ultraviolet light. In different embodiments, the 
linker is cleavable by ammonium hydroxide, formamide, 
or a change in pH (-log H + concentration) . 

In different embodiments of the labeled 
dideoxynucleotide, the ' chemical moiety comprises 
biotin, streptavidin, phenylboronic acid, 

salicylhydroxamic acid, an antibody, or an antigen. 



In one embodiment, the labeled dideoxynucleotide is 
selected from the group consisting of: 
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In one embodiment, the labeled dideoxynucleotide is 
selected from the group consisting of: 




The invention provides the use of any of the labeled 
dideoxynucleotide described herein in DNA sequencing 
using mass spectrometry, wherein the linker increases 
10 mass separation between different labeled 

dideoxynucleotides and increases mass spectrometry 
resolution. 



In one embodiment, the labeled dideoxynucleotide has 



embodiment, the biotinylated moiety is a biotinylated 
DNA sequencing fragment . 

In one embodiment, the chemical moiety can be freed 
from the surface by disrupting the interaction 
between the chemical moiety and the compound coating 
the surface. In different embodiments, the 
interaction can be disrupted by a means selected from 
the group consisting of one or more of a physical 
means, a chemical means, a physical chemical means, 
heat, and light. In different embodiments, the 
interaction can be disrupted by ammonium hydroxide, 
formamide, or a change in pH (-log H + concentration) . 

In one embodiment, the chemical moiety is attached 
via a linker to another chemical compound. In one 
embodiment, the other chemical compound is a DNA 
sequencing fragment. In. one embodiment, the linker 
is cleavable by a means selected from the group 
consisting of one or more of a physical means, a 
chemical means, a physical chemical means, heat, and 
light. In one embodiment, the channel is transparent 
to ultraviolet light and the linker is cleavable by 
ultraviolet light. Cleaving the linker"~f rees the DNA 
sequencing fragment or other chemical compound from 
the chemical moiety which remains captured on the 
surface . 

The invention provides a mult i -channel system which 
comprises a plurality of any of the single channel 
systems disclosed herein. In one embodiment, the 
channels are in a chip . In one embodiment , the 
multi-channel system comprises 96 channels in a chip. 



and 




CH 2 NHC(0)CF 3 

This invention will be better understood from the 
Experimental Details which follow. However, one 
skilled in the art will readily appreciate that the 
specific methods and results discussed are merely- 
illustrative of the invention as described more fully 
in the claims which follow, thereafter . 



phenylboronic acid- sal icylhydroxamic acid (Bergseid 
et al . 2000) and antigen-ant ibody systems. 

As illustrated schematically in Figure 1, DNA 
template, deoxynucleotides" (dNTPs) (A, C, G, T) and 
biotinylated dideoxynucleot ides (ddNTP-biotin) (A-b, 
C-b, G-b, T-b) , primer, and DNA polymerase are 
combined in one tube. After polymerase extension and 
termination reactions, a series of DNA sequencing 
fragments with different lengths are generated. The 
sequencing reaction mixture is then incubated for a 
few minutes with a streptavidin coated solid phase. 
Only the DNA sequencing fragments that are terminated 
with biotinylated dideoxynucleot ide at the 3' end are 
captured on the solid phase. Excess primers, false 
terminated DNA fragments (fragments terminated at 
dNTPs instead of ddNTPs), , enzymes and all other 
components from the sequencing reaction are washed 
away. The biotinylated DNA sequencing fragments are 
then cleaved off the solid phase by disrupting the 
interaction between biotin and streptavidin to obtain 
a pure set of DNA sequencing fragments. The 
interaction between biotin and streptavidin can be 
disrupted '""using, for example, '"ammonium hydroxide, 
formamide, or a change in pH. The DNA sequencing 
fragments are then mixed with matrix (3 -hydroxy - 
picolinic acid) and loaded into a mass spectrometer 
to produce accurate mass spectra of the DNA 
sequencing fragments. Since each type of nucleotide 
has a unique molecular mass, the mass difference 
between adjacent peaks on the mass spectra gives the 
sequence identity of the nucleotides. 
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template (SEQ ID NO: 1) and 13 bp primer (SEQ ID NO 
2) : 



5'-ACTTTTTACTGTTCGATCCCTGCATCTCAGAGCTCGCTATTCCGAGCTTACACGT-3' 

Te m p,ate I II MM I I II II 

S'-TAAGGCTCGAATG-S' 

Primer 



Four commercially available biotinylated 

dideoxynucleotides ddATP- 11 -biot in, ddGTP- 11-biot in , 
ddCTP-ll-biotin and ddTTP-ll-biotin (New England 
Nuclear, Boston) were used to produce the sequencing 

10 ladder that was generated all in one tube using the 

cycle sequencing procedure. It can be seen from 
Figure 2 that very clean sequence peaks are obtained 
on the mass spectra, with the first peak being primer 
extended by one biotinylated dideoxynucleotide . 

15 Furthermore, excess primer in the sequencing reaction 

is completely removed and no false stopped peaks are 
detected. The base identity of A and G can be 
identified unambiguously in Figure 2. Since the mass 
difference between the commercially available ddCTP- 

20 11-Biotin and ddTTP-ll-biotin is one dalton and the 

resolution is only within about 3 daltons in the mass 
detector for DNA fragments, C and T cannot be 
differentiated in Figure 2. The data shows that by 
capturing/releasing DNA sequencing fragments with the 

25 biotin located on the 3' dideoxy terminators, clean 

sequencing ladders that are free from any other 
contaminants can be obtained. Further improvement of 
the procedure requires the use of biotinylated ddTTPs 
that have large mass differences in comparison to 

30 ddCTP-ll-biotin. To achieve this, ddTTP- 16-biotin is 
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Sample preparation is performed in one tube by- 
executing the sequencing reactions with biotinylated 
ddNTPs, regular dNTPs , DNA polymerase, and reaction 
buffer. The sample is then placed in a thermocycler 
5 for 3 0 cycles to create extension fragments. 

Streptavidin beads are then added to the sample and 
incubated to allow the biotin- streptavidin complex to 
form. The beads are collected by placing the reaction 
tube in a magnet and thoroughly washing them with an 

10 ammonium acetate solution to remove all impurities 

such as false stops, primers, and salts. Dilute 
ammonium hydroxide solution is then used to 
dissociate the biotin streptavidin complex at 60 °C 
( Jurinke et . al . , 1997) . Once this complex is 

15 dissociated, the solution is placed back in the 

magnet to separate the beads out of solution. The 
supernatant is collected, added to a matrix solution 
of 3 -hydroxy-picolinic acid (Aldrich) , and allowed to 
crystallize for analysis by a Perkin Elmer Voyager DE 

2 0 MALDI-TOF mass spectrometer. The resulting spectrum 

is assigned according to the positions of the various 
peaks . 



25 
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on the bases in the nucleotides, even with bulky- 
energy transfer fluorescent dyes, still allows 
efficient incorporation of- the modified nucleotides 
into the DNA strand by DNA polymerase (Rosenblum et 
5 al. 1997, Zhu et al . 1994). Thus, the ddNTPs -Linker - 

11-biotin can be incorporated into the growing strand 
by the polymerase in DNA sequencing reactions. 

Larger mass separations will greatly aid in longer 
10 read lengths where signal intensity is smaller and 

resolution is lower. The smallest mass difference 
between two individual bases is over three times as 
great in the mass tagged biotinylated ddNTPs compared 
to normal ddNTPs and more than double that achieved 
15 by the standard biotinylated ddNTPs as shown in Table 

1. Three 4 -aminomethyl benzoic acid derivatives 
Linker I, Linker II and Linker III are designed as 
mass tags as well as linkers for bridging biotin to 
the corresponding dideoxynucleot ides . The synthesis 
20 of Linker II (Figure 5) is described here to 

illustrate the synthetic procedure. 3-Fluoro-4- 
aminomethyl benzoic acid that can be easily prepared 
via published procedures (Maudling et al . 1983; Rolla 
1982) is first protected with trrf lubroacetic" 

2 5 anhydride, then converted to N-hydroxysuccinimide 

(NHS) ester with disuccinimidylcarbonate in the 
presence of diisopropylethylamine . The resulting NHS 
ester is subsequently coupled with commercially 
available propargylamine to form the desired 

3 0 compound, Linker II. Using an analogous procedure, 

Linker I and Linker III can be easily constructed. 



released into solution by cleaving the photocleavable 
linkex- with ultraviolet (UV) light, while the biotin 
remains attached to the streptavidin that is 
covalently bound to the surface. The pure DNA 
fragments can then be crystallized in matrix solution 
and analyzed by mass « spectrometry. It is 

advantageous to cleave the biotin moiety since it 
contains sulfur which has several relatively abundant 
isotopes. The rest of the DNA fragments and linkers 
contain only carbon, nitrogen, hydrogen, oxygen, 
fluorine and phosphorous, whose dominant isotopes are 
found with a relative abundance of 99% to 100%. This 
allows high resolution mass spectra to be obtained. 
The photocleavage mechanism (Olejnik et al . 1995, 
1999) is shown in Figure 8. Upon irradiation with 
ultraviolet light at 300-350 nm, the light sensitive 
o-nitroaromatic carbonamide functionality on DNA 
fragment 1 is cleaved, producing DNA fragment 2, PC- 
biotin and carbon dioxide. The partial chemical 
linker remaining on DNA fragment 2 is stable for 
detection by mass spectrometry. 

Four new biotinylated ddNTPs disclosed here, ddCTP- 
PC-Biotin, ddTTP-Linker^r-PC-Biotin, ddATP-Linker II- 
PC-Biotin and ddGTP-Linker III-PC-Biotin are shown in 
Figure 9. These compounds are synthesized by a 
similar chemistry as shown for the synthesis of 
ddATP-Linker II -11 -Biotin in Figure 6 . The only 
difference is that in the' ( final coupling step NHS-PC- 
LC-Biotin (Pierce, Rockford IL) is used, as shown in 
Figure 10. The photocleavable linker's disclosed here- 
allow the use of solid phase capturable terminators 
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disrupted by a means selected from the group 
consisting of one or more of a physical means, a 
chemical means, a physical chemical means., heat, and 
light. In one embodiment, ultraviolet light can be 
5 used to cleave the cleavable linker. Chemical means 

include, but are not limited to, ammonium hydroxide 
(Jurinke et . al . , 1997), formamide, or a change in pH 
(-log H + concentration) of the solution. 



10 V. High density streptavidin-coated, porous silica 

channel system. 

Streptavidin coated magnetic beads are not ideal for 
using the photocleavable biotin capture and release 

15 process for DNA sequencing fragments, since they are 

not transparent to UV light. Therefore, the 
photocleavage reaction v is not efficient. For 
efficient capture of the biotinylated sequencing 
fragments, a high-density surface coated with 

20 streptavidin is essential. It is known that the 

commercially available 96-well streptavidin coated 
plates cannot provide a sufficient surface area for 
efficient capture of the biotinylated DNA fragments. 
Disclosed" irT~ tKl's ~"appl~i cation is a new porous silica 

25 channel system designed to overcome this limitation. 

To increase the surface area available fox* solid 
phase capture, porous channels are coated with a high 
density of streptavidin. Ninety-six (96) porous 

30 silica glass channels can be etched into a silica 

chip (Figure 12) . The surfaces of the- channels are 
modified to contain streptavidin as shown in Figure 
13. The channel is first treated with 0.5 M NaOH , 



The fragment solution is then driven out of the 
channel and into a collection plate. After matrix 
solution is added, the samples are spotted on a chip 
and allowed to crystallize for detection by MALDI-TOF 
mass spectrometry. The- purification cassette is 
cleaned by chemically cleaving the biotin- 
streptavidin linkage, and is then washed and reused. 

VI. Validation of the Mass Spectrometry DNA 
Sequencing System Using Synthetic DNA Templates and 
PGR Templates Generated from Genomic DNA. 



To validate the sequencing technology disclosed here, 
a synthetic DNA template can be synthesized which 
mimics a portion of the human immunodeficiency virus 
type 1 protease gene. The sequence of the template 
(SEQ ID NO: 3) and that of the sequencing primer (SEQ 
ID NO: 4) are shown below (Schmit et al . 1996) : 



S'-TAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATGGTCCAGGTCGTG-S' 
Temnlate I II I I I I I I I I I I 

H 3 , -CCAGGTCCAGCAC-5 t 

Primer 



-The tumor suppressor gene p53 can also be used as-'a 
model system. The p53 gene is one of the most 
frequently mutated genes in human cancer (O'Connor et 
al - 1997) . Since most of the p53 mutation hot spots 
are clustered within exons 5-8, this region of the 
p53 gene is selected as a sequencing target. A 
synthetic sequencing template containing a portion of 
the sequences from exon 7 and exon 8 of the p5 3 gene 
and an appropriate primer can be prepared: 



Template : 5 ' - CATGTGTAACAGTTCCTGCATGGGCGGCATGAACCCGAGG 
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What is claimed is: 



1 . A method for sequencing DNA by detecting the 
identity of a dideoxynucleotide incorporated to 
5 the 3' end of a DNA sequencing fragment using 

mass spectrometry, which comprises : 

(a) attaching a chemical moiety via a linker to 
a dideoxynucleotide to produce a labeled 
dideoxynucleotide ; 

10 (b) terminating a DNA sequencing reaction with 

the labeled dideoxynucleotide to generate a 
labeled DNA sequencing fragment, wherein 
the DNA sequencing fragment has a 3' end 
and the chemical moiety is attached via the 

15 linker to the 3' end of the DNA sequencing 

fragment ; 

(c) capturing the labeled DNA sequencing 
• fragment on a surface coated with a 

compound that specifically interacts with 
20 the chemical moiety attached via the linker 

to the DNA sequencing fragment, thereby 
capturing the DNA sequencing fragment; 

(d) washing the surface to remove any non-bound 
component ; 

2 5 ( e ) freeing the DNA sequencing fragment from 

the surface ; and 
( f ) analyzing the DNA sequencing fragment using 
mass spectrometry so as to sequence the 
DNA. 



30 



A method for sequencing DNA by detecting the 
identity of a plurality of dideoxynucleotides 
incorporated to the 3' end of different DNA 



-61- 

and the compound on the surface comprises a 
biotin-streptavidin interaction, a phenylboronic 
acid- sal icylhydroxamic acid interaction, or an 
antigen -antibody interaction. 

The method of claim 1 or 2, wherein the step of 
freeing the DNA sequencing fragment from the 
surface comprises disrupting the interaction 
between the chemical moiety attached via the 
linker to the DNA sequencing fragment and the 
compound on the surf ape . 

The method of claim 5, wherein the interaction 
is disrupted by a means selected from the group 
consisting of one or more of a physical means, a 
chemical means, a physical chemical means, heat, 
and light. 

The method of claim 1 or 2 , wherein the 
dideoxynucleotide comprises a cytosine or a 
thymine with a 5-position, or an adenine or a 
guanine with a 7 -pos.it ion, and the linker is 
attached to the '5-position of cytosine oi^ 
thymine or to— the 7 -position- of -adenine or 
guanine . 

The method of claim 1 or 2, wherein the step of 
freeing the DNA sequencing fragment from the 
surface comprises cleaving the linker. 

The method of claim 8, where the linker is 
cleaved by a means selected from the group 
consisting of one or more of a physical means, a 



The method of claim 12 , wherein the linker is 
selected from the group consisting of: 




CH 2 NHC(0)CF 3 




CH 2 NHC(0)CF 3 

and 




CH2NHC(0)CF3 

m * 

* * 

The method""" *'of iJ *' claim' 17" wherein a plurality of 
different labeled dideoxynucleotides is used to 
generate a plurality of different labeled DNA 
sequencing fragments . 

The method of claim 3 or 14 , wherein a plurality 
of different linkers is used to increase mass 
separation between different labeled DNA 
sequencing fragments and thereby increase mass 
spectrometry resolution . 



The method of claim 16, wherein the biotinylated 
dideoxynucleotide is selected from the group 
consisting of: 



ddNTPI 




H _o 
H v N -f 



ddNTP 




ddNTP3 




ddNTP 



H o 
S—f H 



and 




H n 



S— / H 



wherein ddNTPI , ddNTP2 , ddNTP3 , and ddNTP4 

represent four different dideoxynucleotides . 
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The method of claim 16, wherein the biotinylated 
dideoxynucleotide is selected from the group 
consisting of: 




wherein ddNTPl , ddNTP2 , ddNTP3 , and ddNTP4 
represent four different dideoxynucleotides . 
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Use of the method of claim 1 or 2 for detection 
of single nucleotide polymorphisms, genetic 
mutation analysis, serial analysis of gene 
expression, gene expression analysis , 

identification in forensics, genetic disease 
association studies, genomic sequencing, 
translational analysis , or transcriptional 
analysis. 

A linker for attaching a chemical moiety to a 
dideoxynucleotide, wherein the linker comprises 
a derivative of 4 -aminomethyl benzoic acid. 

The linker of claim 25, wherein the linker- 
comprises one or more fluorine atoms, 

*» 

The linker of claim 26, wherein the linker is 
selected from the group consisting of : 




CH2NHC(0)CF3 




CH2NHC(0)CF3 



and 
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dideoxynucleotides and increases mass 

spectrometry resolution. 

A labeled dideoxynu'cleotide , which comprises a 
chemical moiety attached via a linker to a 5- 
position of cytosine or thymine or to a 7- 
position of adenine or guanine. 

The labeled dideoxynucleotide of claim 33, 
wherein the linker is cleavable by a means 
selected from the group consisting of one or 
more of a physical means, a chemical means, a 
physical chemical means, heat, and light. 

The labeled dideoxynucleotide of claim 34, 
wherein the linker is cleavable by ultraviolet 
light . 

The labeled dideoxynucleotide of claim 33, 
wherein the chemical moiety comprises biotin, 
streptavidin, phenylboronic acid, 

salicylhydroxamic acid, an antibody, or an 
antigen . 




• 
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18. The labeled dideoxynucleotide of claim 37, 
wherein the labeled dideoxynucleotide is 
selected fi-om the group consisting of: 



ddCTP 




S— 1 H 



ddTTP 




ddATP 




ddGTP 



and 
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40. The labeled dideoxynucleotide of claim 39, 
wherein the labeled dideoxynucleotide is 
selected from the group consisting of: 




41. Use of the labeled dideoxynucleotide of claim 33 
in DNA sequencing using mass spectrometry, 
10 wherein the linker increases mass separation 

between different labeled dideoxynucleotides and 
increases mass spectrometry resolution. 
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47. The system of claim 46, where the interaction 
can be disrupted by a means selected from the 
group cons is t ing of one or more of a phys ical 

5 means, a chemical mqans, a physical chemical 

means, heat, and light. 

48. The system of claim 42, wherein the chemical 
moiety is attached via a linker to another 

10 chemical compound. 

49. The system of claim 48, wherein the other 
chemical compound is- a'DNA sequencing fragment. 

15 50. The system of claim 48, where the linker is 

cleavable by a means selected from the group 
consisting of one or more of a physical means, a 
chemical means, a physical chemical means, heat, 
and light. 

20 

51. The system of claim 50, wherein the channel is 
transparent to ultraviolet light and the linker 
is cleavable by ultraviolet light. 

25 52. A multi-channel system, which comprises a 

plurality of the system of claim 42 . 

53 . The multi-channel system of claim 52 , wherein 

■ ■ * * 

the channels are in a chip. 



30 



54. The multi-channel system of claim 53, which 
comprises 96 channels in a chip. 



10 



15 
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58. The method of claim 57, wherein one or more of 
the different linkers is selected from the group 
consisting of: 




CH2NHC(0)CF3 




CH2lMHC(0)CF3 



and 




20 
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SEQUENCE LISTING 
<1 10> The Trustees of Columbia University in the City of New York 

<120> High-Fidelity DNA Sequencing Using Solid Phase Capturable Dideoxynucleotides And 
Mass Spectrometry 

<130> 0575/62948-PCT/JPW/ADM/BJA/AX 

<160> 6 

<170> Patentln version 3.0 

<210> 1 
<211> 55 
<212> DNA 

<213> synthetic: template 
<400> 1 

actttttact gttcgatccc tgcatctcag agctcgctat tccgagctta cacgt 55 

<210> 2 

<211> 13 

<212> DNA 

<213> synthetic: primer 

<400> 2 

taaggctcga atg 13 
<210> 3 .... . 

Page 1 



<211> 61 
<212> DNA 

<213> synthetic: template 
<400> 3 

taaagctata ggtacagtat tagtaggacc tacacctgtc aacataatgg tccaggtcgt 60 
g 61 

<210> 4 

<211> 13 

<212> DNA 

<213> synthetic: primer 

<400> 4 

ccaggtccag cac 1 3 

<210> 5 
<211> 129 
<212>.DNA 

<213> synthetic: template 
<400> 5 

catgtgtaac agttcctgca tgggcggcat gaacccgagg cccatcctca ccatcatcac 60 
actggaagac tccagtggta atctactggg acggaacagc tttgaggtgc atgtttgtgc 120 
ctgtcctgg ' 129 

Page 2 



<210> 6 

<211> 15 

<212> DNA 

<213> synthetic: primer 

<400> 6 
ccaggacagg cacaa 
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